Noise-driven Temporal Trajectory Filtering of Spectral Parameters for Robust Speech Recognition
نویسندگان
چکیده
Spectral parameter filtering such as RASTA [5], RASTA-like bandpass filtering and high-pass filtering operates on temporal dynamics of spectral parameters, and has been effective method to reduce channel distortions. Temporal derivatives (delta and delta-delta coefficients, that have proved as robust representation) and spectral mean normalization [4] are also equivalent to filtering that reject lower modulation frequencies from spectral parameters. Spectral normalization and parameter filtering assume that channel distortions are linear and additive noise is negligible [2]. As the additive noise and channel distortions are additive in different domains, they cannot be simultaneously suppressed by these methods [2]. In this paper, we extend the filtering technique of temporal trajectories to handle additive noise. The paper investigates the possibility of applying ‘spectral subtraction’ [3] filter to time-trajectories of spectral parameters and the problems that may arise.
منابع مشابه
Robust Speech Recognition Features Based on Temporal Trajectory Filtering and Non-Uniform Spectral Compression
This paper proposes a new feature extraction method based on temporal trajectory filtering and nonuniform spectral compression and examines its performance with two tasks in noisy environments. Temporal trajectory filtering is effective for robust speech recognition in noisy environments, due to human hearing is more sensitive to relative values rather than absolute values and the effect of add...
متن کاملImproving the performance of MFCC for Persian robust speech recognition
The Mel Frequency cepstral coefficients are the most widely used feature in speech recognition but they are very sensitive to noise. In this paper to achieve a satisfactorily performance in Automatic Speech Recognition (ASR) applications we introduce a noise robust new set of MFCC vector estimated through following steps. First, spectral mean normalization is a pre-processing which applies to t...
متن کاملA new feature extraction front-end for robust speech recognition using progressive histogram equalization and multi-eigenvector temporal filtering
In this paper, a new feature extraction front-end for robust speech recognition using progressive histogram equalization and multi-eigenvector temporal filtering is proposed. The progressive histogram equalization (PHEQ) performs the histogram equalization (HEQ) progressively with respect to a reference interval which moves with the present frame to be processed. The multi-eigenvector temporal ...
متن کاملRobust Speech Recognition for Adverse Environments
As the state-of-the-art speech recognizers can achieve a very high recognition rate for clean speech, the recognition performance generally degrades drastically under noisy environments. Noise-robust speech recognition has become an important task for speech recognition in adverse environments. Recent research on noise-robust speech recognition mostly focused on two directions: (1) removing the...
متن کاملSignal Trajectory Based Noise Compensation for Robust Speech Recognition
This paper presents a novel signal trajectory based noise compensation algorithm for robust speech recognition. Its performance is evaluated on the Aurora 2 database. The algorithm consists of two processing stages: 1) noise spectrum is estimated using trajectory autosegmentation and clustering, so that spectral subtraction can be performed to roughly estimate the clean speech trajectories; 2) ...
متن کامل